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Abstract. We consider the class of languages defined in the 2- variable fragment of the 
first-order logic of the linear order. Many interesting characterizations of this class are 
known, as well as the fact that restricting the number of quantifier alternations yields an 
infinite hierarchy whose levels are varieties of languages (and hence admit an algebraic 
characterization). Using this algebraic approach, we show that the quantifier alternation 
hierarchy inside F0^[<] is decidable within one unit. For this purpose, we relate each level 
of the hierarchy with decidable varieties of languages, which can be defined in terms of 
iterated deterministic and co-deterministic products. A crucial notion in this process is 
that of condensed rankers, a refinement of the rankers of Weis and Immerman and the 
turtle languages of Schwentick, Therien and VoUmer. 

Many important properties of systems are modeled by finite automata. Frequently, the 
formal languages induced by these systems are definable in first-order logic. Our under- 
standing of its expressive power is of direct relevance for a number of application fields, 
such as verification. 

The first-order logic we are interested in, in this paper, is the first-order logic of the linear 
order, written F0[<], interpreted on finite words. It is well-known that the languages that 
are definable in this logic are exactly the star-free languages, or equivalently the regular 
languages whose syntactic monoid is aperiodic (that is: satisfies an identity of the form 
^n-\-i _ £qj, gQj^g integer n) [MllIS] (see also O |20l [271 [29] ) ; and that deciding whether 
a finite automaton accepts such a language is PSPACE-complete [3]. 

Fragments of first-order logic defined by the limitation of certain resources have been 
studied in detail. For instance, the quantifier alternation hierarchy, with its close relation 
with the dot-depth hierarchy of star-free languages, offers one of the oldest open problems 
in formal language theory: we know that the hierarchy is infinite and that its levels are 
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characterized algebraically (by a property of the syntactic monoids), but we do not know 
whether these levels (besides levels and 1) are decidable. In contrast, it is known that the 
quantifier alternation hierarchy for the first-order logic of the successor, FO[S'], collapses at 
level 2 [Sam]. 

Another natural limitation considers the number of variables in a formula. This lim- 
itation has attracted a good deal of attention, as the trade-off between formula size and 
number of variables is known to be related with the trade-off between parallel time and 
number of processes, see [HKHlS]. 

It is well-known that every first-order formula is equivalent to one using at most three 
variables. On the other hand, the first-order formulas using at most two variables, written 
F0^[<], are strictly less expressive. The class of languages defined by such formulas admits 
many remarkable characterizations |31j . To begin with, a language is F02[<]-definable if 
and only if it is recognized by a monoid in the pseudovariety DA [33j (a precise definition 
will be given in Section [2]). As with the characterization of FO[<]-definability by aperiodic 
monoids, this characterization implies decidability. The FO^[<]-definable languages are also 
characterized in terms of unambiguous products of languages (see Section 16. Sp and in terms 
of the unary fragment of propositional temporal logic [7] (see Section ll.Sj) . For a survey of 
these properties, the reader is referred to (HU |6]. 

In this paper, we consider the quantifier alternation hierarchy within the two-variable 
fragment of first-order logic. We denote by F0^[<] the fragment of F0^[<] consisting of 
formulas using at most 2 variables and at most m alternating blocks of quantifiers. In the 
sequel, we omit specifying the predicate < and we write simply FO, FO^ or FO^. 

Schwentick, Therien and Vollmer introduced the so-called turtle programs to character- 
ize the expressive power of FO^ [26]. These programs are sequences of directional instruc- 
tions of the form go to the next a to the right, go to the next b to the left. More details 
can be found in Section 11.21 below. Turtle programs were then used, under the name of 
rankers, by Weis and Immerman [37] (first published in [36]) to characterize FO^ in terms 
of rankers with m alternations of directions (right vs. left). Their subtle characterization. 
Theorem 11.81 below, does not yield a decidability result. It forms however the basis of our 
results. 

Rankers are actually better suited to the study of a natural alternation hierarchy within 
the unary fragment of propositional temporal logic (Sections II. 31 and than to the study of 
the quantifier alternation within F0^[<]. For the latter, we define the notion of condensed 
rankers, which introduce a notion of efficiency in the path they describe in a word, see 
Section [3l 

Recent results of Kufleitner and Lauser [16J and Straubing [28] show that TO'^ (the 
set of FO^-definable languages) forms a variety of languages. We show that the classes 
of languages defined by condensed rankers with at most m changes of directions also form 
varieties of languages, written Ttm and Lm depending on whether the initial move is towards 
the right or towards the left (Section 13. 3p . The meaning of these results is that membership 
of a language L in these classes depends only on the syntactic monoid of L. This justifies 
using algebraic methods to approach the decidability problem for TO^^ — a technique that 
has proved very useful in a number of situations (see for instance [20l IHH [30l [6]). 

In fact, we use this algebraic approach to show that the classes TZm and Lm are decid- 
able (Section 13. Sp . and that they admit a neat characterization in terms of closure under 
alternated deterministic and co-deterministic products (Section 13. 4p . Moreover, we show 



LOGICAL HIERARCHIES WITHIN FO^-DEFINABLE LANGUAGES 



3 



(Theorem [M]) that 

TZm U Cm ^ J'O'^ C Tlm+1 H Cm+1- 

This shows that one can effectively compute, given a language L in J^O^, an integer m such 
that L is in J^O^,^^, possibly in J^O'^, but not in J'0'^_i. That is, we can compute the 
quantifier alternation depth of L within one unit. As indicated above, this is much more 
precise than the current level of knowledge on the general quantifier alternation hierarchy 
in F0[<]. 

We conjecture that J^Om is actually equal to the intersection of TZm+i and Cm+i- This 
would prove that each J^O^ is decidable. 

Many of these results were announced in with a few differences. In particular, 
the definition of the sets i?^,n (Section II. 2p in [T3] introduced a mistake which is corrected 
here. The proof of |14l Theorem 2] contained a gap: we do not have a proof that the classes 
defined by the alternation hierarchy within unary temporal logic are varieties. And the 
proof of \14:\ Proposition 2.9] also contained a gap: the correct statement is Theorem 14.31 
below. 

1. Rankers and logical hierarchies 

Let A be a finite alphabet. We denote by A* the set of all words over A (that is, of sequences 
of elements of A), and by A~^ the set of non-empty words. If u is a length n (n > 0) word 
over A, we say that an integer 1 < i < n is an a-position of u if the i-th letter of n, written 
u[i], is an a. If 1 < « < j < n, we let u[i;j] be the factor u[i] • • • u[j] of u. 

FO denotes the set of first-order formulas using the unary predicates a (a G j4) and the 
binary predicate <, and FO^ denotes the fragment of FO consisting of formulas which use 
at most two variable symbols. 

If n is a length n (n > 0) word over A, we identify the word u with the logical structure 
({!,... ,n},(a)aeA)i where a denotes the set of a-positions in u. Formulas from FO are 
naturally interpreted over this structure, and we denote by L{ip) the language defined by 
the formula ip € FO, that is, the set of all words which satisfy (p. 

1.1. Quantifier-alternation within FO^. We now concentrate on FO^-formulas and we 
define two important parameters concerning such formulas. To simplify matters, we consider 
only formulas where negation is used only on atomic formulas so that, in particular, no 
quantifier is negated. This is naturally possible up to logical equivalence. Now, with each 
formula (p G FO^, we associate in the natural way a parsing tree: each occurence of a 
quantification, 3x or Vx, yields a unary node, each occurrence of V or A yields a binary 
node, and the leaves are labeled with atomic or negated atomic formulas. The quantifier 
depth of (p is the maximum number of quantifiers along a path in its parsing tree. 

With each path from root to leaf in this parsing tree, we also associate its quantifier 
label, which is the sequence of quantifier node labels (3 or V) encountered along this path. 
A block in this quantifier label is a maximal factor consisting only of 3 or only of V, and 
we define the number of blocks of (p to be the maximum number of blocks in the quantifier 
label of a path in its parsing tree. Naturally, the quantifier depth of <p is at least equal to 
its number of blocks. 
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We let FO^ „ > W') denote the set of first-order formulas with quantifier depth at 
most n and with at most m blocks and let FO^ denote the union of the FO^ „ for all n. We 
also denote by TO"^ {FOl^, TO"^ ,^) the class of FO^ (FO^, FO,^ „)-definable languages. 

Remark 1.1. Recall that a language is piecewise testable if it is a Boolean combination 
of languages of the form A*aiA* ■ ■ ■ UkA* (cj & A). It is an elementary observation that 
the piecewise testable languages coincide with J-'Of. It is well-known that this class of 
languages is decidable (see Section [2] below). 

1.2. Rankers. A ranker is a non-empty word on the alphabet {Xa,Ya | a G A}. Rankers 
define positions in words: given a word u € A^ and a letter a G ^, we denote by Xa(u) 
(resp. Ya{u)) the least (resp. greatest) integer 1 < i < |n| such that u[i] = a. If a does not 
occur in u, we say that ¥„(«) and Xa{u) are not defined. If in addition q is an integer such 
that < q < we let 

Xa{u, q) = q + Xa(u[g + 1; |m|]) 

Y,(n,(z) =Y,(n[l;g-l]). 

These definitions are extended to all rankers: if r' is a ranker, Z G {Xa,Ya | a G A} and 
r = r'Z, we let 

r{u,q) = Z{u,r'{u,q)) 

if r'{u,q) and Z{u,r'{u,q)) are defined, and we say that r(u,q) is undefined otherwise. In 
particular: rankers are processed from left to right. 

Finally, if r starts with an X- (resp. Y-) letter, we say that r defines the position 
r(u) = r{u, 0) (resp. r{u) = r{u, \u\ + 1)), or that it is undefined on u if this position does 
not exist. 

Remark 1.2. Rankers were first introduced, under the name of turtle programs, by Schwen- 
tick, Therien and Vollmer [26], as sequences of instructions: go to the next a to the right, go 
to the next b to the left, etc. These authors write (— >, a) and (^, a) instead of Xq and Y^. 
Weis and Immerman |37j write >a and <a instead, and they introduced the term ranker. We 
rather follow the notation in [6l [l3l S], where X and Y refer to the future and past operators 
of PTL. 

Example 1.3. The ranker XaY^Xc (go to the first a starting from the left, thence to the 
first b towards the left, thence to the first c towards the right) is defined on bac and bca, 
but not on abc or cba. 

By L{r) we denote the language of all words on which the ranker r is defined. We say 
that the words u and v agree on a class R of rankers if exactly the same rankers from R 
are defined on u and v. And we say that two rankers r and s coincide on a word u if they 
are both defined on u and r(u) = s{u). 

Example 1.4. If r = X^^ • • • X^j. (resp. r = Y^^ • • • Y^j), then L{r) is the set of words that 
contain ai - ■ ■ aj. as a subword, L(r) = A*aiA* ■ ■ ■ UkA*. 

The depth of a ranker r is defined to be its length as a word. A block in r is a maximal 
factor in {Xq | a G A}^ (an X-block) or in {Yq | a G A}'^ (a Y-block). If n > m, we denote 
by R^ „ (resp. R^ „ ) the set of m-block, depth n rankers, starting with an X- (resp. Y-) 
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block, and we let Rm,n — Rm,n U -^m,n l£m,n — Um'<m,n'<n ^m' ,n' ^ Um'<m,n'<n -^m',n'- 

We define i?^ „ dually and we let = Un>m Rm,n, Rm = Un>m i^m.n and R„, = R^UR^^. 

Remark 1.5. Readers familiar with |37j will notice differences between our i?^ „ and their 
analogous Rmt>,n'^ introduced for technical reasons, it creates no difference between our Rm,n 
and their R^n^ the classes which intervene in crucial Theorem 11.81 below. 

1.3. Rankers and unary temporal logic. Let us depart for a moment from the con- 
sideration of FO^-formulas, to observe that rankers are naturally suited to describe the 
different levels of a natural class of temporal logic. The symbols and Ya (a G A) can be 
seen as modal (temporal) operators, with the future and past semantics respectively. We 
denote the resulting temporal logic (known as unary temporal logic) by TL: its only atomic 
formula is T, the other formulas are built using Boolean connectives and modal operators. 
Let u G and let < i < \u\ + 1. We say that T holds at every position i, {u,i) \= T; 
Boolean connectives are interpreted as usual; and {u, i) \= Xa^p (resp. Yaf) if and only 
if {u,Xa{u,i)) \= if (resp. (tt, ¥„(«, z)) |= ip). We also say that u \= Xa^p (resp. YaV') if 
(u, 0) \= Xaip (resp. {u, 1 + |n|) |= YaV>)- 

TL is a fragment oi propositional temporal logic PTL; the latter is expressively equivalent 
to FO and TL is expressively equivalent to FO^ |13j . 

As in the case of FO^-formulas, one may consider the parsing tree of a TL-formula and 
define inductively its depth and number of alternations (between past and future operators). 
If n > m, the fragment TL^ „ (resp. TL^ „) consists of the TL-formulas with depth n and 
with m alternated blocks, in which every branch (of the parsing tree) with exactly m 
alternations starts with future (resp. past) operators. Branches with less alternations may 
start with past (resp. future) operators. The fragments TL^.n, TL^ „, TL^ „, ZLm' XLm 
and TL ^ are defined according to the same pattern as in the definition of Rm,n, I^n> i^m n; 
i?m' iim and R^. We also denote by TC^ n O'^'rm TLm^ ^^c.) the class of TL^ „ (TL^, 
TLm, etc.) -definable languages. 

Proposition 1.6. Let 1 < m < n. Two words satisfy the same TL^ „ formulas if and only 
if they agree on rankers from i?mn- ^ language is in 7X^„ if and only if it is a Boolean 
combination of languages of the form L{r), r € ^nn- 

Similar statements hold for TL^ „ , ILm? XLm '^'^^ TL^, relative to the corresponding 
classes of rankers. 

Proof. Since every ranker can be viewed as a TL-formula, it is easily verified that if u and 
V satisfy the same TL^^ -formulas, then they agree on rankers from i?m,n- To prove the 
converse, it suffices to show that a TL ^ ^-formula is equivalent to a Boolean combination 
of formulas that are expressed by a single ranker. That is: we only need to show that 
modalities can be brought outside the formula. This follows from the following elementary 
logical equivalences: 

Xa{'^^1|J) =Xaip AXall^, 

Xai^y ^P) = Xa^yXaip, 

Xa{^^) = X^T A -^Xaip. 

□ 
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Remark 1.7. Together with Example 11.41 this proposition confirms the elementary ob- 
servation that a language is TL ^ (resp. TL^, TL^) definable if and only if it is piecewise 
testable (see Remark ll.lj) . It follows that TL^-definability is decidable. 

1.4. Rankers and FO^. The connection established by Weis and Immerman [37', Theorem 
4.5] between rankers and formulas in F0^„, Theorem 11.81 below, is much deeper. If x^y 
are integers, we let ord(x,y), the order type of x and y, be one of the symbols <, > or =, 
depending on whether x<y,x>y or x = y. 

Theorem 1.8 (Weis and Immerman [ST])- Let u,v A* and let 1 < m < n. Then u and 
V satisfy the same formulas in FO^ „ if and only if 
(WI 1) u and v agree on rankers from ^, 

(WI 2) if the rankers r G Rmn ^'^'^ ^' ^ Rm-in-i ^''"^ defined on u and v, then 

ord(r(ii), r'(u)) = ord(r(w), r'(f )). 
(WI 3) if r e R^^ and r' G _i are defined on u and v and end with different 

direction letters, then ord(r(n), r'(n)) = oid{r{v),r' (v)) . 

Corollary 1.9. For each n > m > 1, TC^ „ C TOl^ ,^ and TC ^ C J"©^. 

Proof. Let L be a TL ^ ^-definable language. For each u £ L, let be the conjunction of 
the FO^ ^-sentences satisfied by u and let tp be the disjunction of the formulas (pu [u € L). 
Since F0^„ is finite (up to logical equivalence), the conjunctions and disjunctions in the 
definition of are all finite. We show that L = L{p). 

A word V satisfies (/? if and only if it satisfies pu for some word u £ L. Then v satisfies 
the same FO^ ^-sentences as u and, by comparing the statements in Proposition 11.61 and 
Theorem II. 8^ we see that u and v satisfy the same TL^ ^-formulas. Since L is defined 
by such a formula, it follows that v £ L. Conversely, every word v £ L satisfies p since 
it satisfies (/J^,, which is logically equivalent to a term in the disjunction defining p. This 
concludes the proof. □ 



2. On varieties and pseudovarieties 

Recent results show that the FO^-definability of a language L can be characterized alge- 
braically, that is, in terms depending only on the syntactic monoid of L. This justifies ex- 
ploring the algebraic path to tackle the decidability of this definability problem. Eilenberg's 
theory of varieties provides the mathematical framework. In this section, we summarize the 
information on monoid and variety theory that will be relevant for our purpose. For more 
detailed information and proofs, we refer the reader to [Sni El EH ESI EO], among other 
sources. 

A semigroup is a set equipped with a binary associative operation. A monoid is a 
semigroup which contains a unit element. The set A* of all words on alphabet A, equipped 
with the concatenation product, is the free monoid on A: it has the specific property 
that, a (p: A ^ M is a, map into a monoid, then there exists a unique monoid morphism 
tp: A* ^ M which extends ip. Apart from free monoids, the semigroups and monoids which 
we will consider in this paper are finite. 

If A is a finite alphabet and M is a finite monoid, we say that a language L C A* is 
recognized by M if there exists a morphism (p: A* ^ M such that L = Lp~^{(p{L)). 
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Example 2.1. li u e A* and B Q A, let 

alph(n) = {a € A I n = vaw for some v,w (z A*}, 

[B] = {ueA*\ alph(n) = B} 

Let (p be the following morphism from A* into the direct product of |^| copies of the 
2-element monoid {1,0} (multiplicative): for each letter a A, (p{a) is the ^-tuple in 
which every component is 1, except for the a-component. It is elementary to show that 
[B] = ip~^ {ip{[B])) and hence, [B] is accepted by a monoid that is idempotent (every ele- 
ment is equal to its own square) and commutative. Conversely, one can show that every 
language recognized by an idempotent and commutative monoid is a Boolean combination 
of languages of the form [B] (B Q A). 

A pseudovariety of monoids is a class of finite monoids which is closed under taking 
direct products, homomorphic images and submonoids. A class of languages V is a collection 

V = {V{A))a, indexed by all finite alphabets A, such that V(^) is a set of languages in A*. 
If V is a pseudovariety of monoids, we let V(^) be the set of all languages of A* which are 
recognized by a monoid in V. The class V has important closure properties: each V{A) 
is closed under Boolean operations and under taking residuals (if L € V(^) and u & A* , 
then Lu~^ and u^^L are in V{A)); and if ip: A* B* is a morphism and L € V{B), 
then ip~^{L) € V{A). Classes of recognizable languages with these properties are called 
varieties of languages, and Eilenberg's theorem (see |20j) states that the correspondence 

V I— )• V, from pseudovarieties of monoids to varieties of languages, is one-to-one and onto. 
Moreover, the decidability of membership in the pseudovariety V, implies the decidability of 
the variety V: indeed, a language is in V if and only if its (effectively computable) syntactic 
monoid is in V. 

For every finite semigroup S, there exists an integer, usually denoted u, such that every 
element of the form in S is idempotent. The Green relations are another important 
concept to describe semigroups and monoids: if 5 is a semigroup and s,t & S, we say that 
s <j t (resp. s <Ti t, s <c t) ii s = utv (resp. s = tv, s = ut) for some u,v £ S L) {!}. We 
also say that s J' t is s <j t and t <j s. The relations TZ and C are defined similarly. 

Pseudovarieties that will be important in this paper are the following. 

- Ji, the pseudovariety of idempotent and commutative monoids; as discussed in Ex- 
ample 12. H the corresponding variety of languages consists of the Boolean combinations of 
languages of the form [B] . 

- R, L and J, the pseudovarieties of TZ-, C- and J'-trivial monoids; a monoid is, say, 
R-trivial if each of its 7^-classes is a singleton. The variety of languages corresponding to 
J was described by Simon (see [2Q]): it is exactly the class of piecewise testable languages, 
i.e., the class of FO^-definable languages, see Remarks 11.11 and ll. 71 

- A, the variety of aperiodic monoids, i.e., monoids in which x'^ = x^'^^ holds for each 
X. Celebrated theorems of Schiitzenberger, McNaughton and Papert and Kamp show that 
the corresponding variety of languages consists of the star-free languages, the languages 
that are definable in FO, and the languages definable in propositional temporal logic, see 
for instance [20l |32l El |29l [30] . 

- DA is the pseudovariety of all monoids in which {xy)^ x{xy)^ = {xy)'^ for all x, y. This 
pseudovariety has many characterizations in combinatorial, algebraic and logical terms. Of 
particular interest to us is the fact that the corresponding variety of languages consists of 
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Figure 1: The positions defined by r in u, when r = Xa^Xa2y^a3^a4^ a^^a^ is condensed 
on n 

the languages that are definable in FO^, and equivalently, of the languages that are defined 
in unary temporal logic, see [3il |32l El HSj [35] among others. 

- Straubing showed that, for each m > 1, J-'O'^ is a variety of languages, and he 
described the corresponding pseudovariety of monoids, which we write FO^, in terms of 
iterated block products [28] , We will not need to discuss the definition of the block product 
here, retaining only that this characterization does not imply decidability, and that Straub- 
ing gave identities (using products and tj-powers like the identities given above for A and 
DA) which he conjectures define each FO^. Establishing this conjecture would prove the 
decidability of FO^-definability. 

— Kufleitner and Lauser also showed that, for each n > m > 1, FO"^^ and J-O'^ form 
varieties of languages, using a general result on logical fragments [16, Cor. 3.4]. Their 
result also does not imply a decidability statement. 

— On a given monoid M, we define the congruences ~k and ~d as follows. 

• u ~K V if and only if, for each idempotent e in M, we have either en, ev <j e oi 
eu = ev, 

• u ~D V if and only if, for each idempotent e in M, we have either have ne, ve <j e or 
ue = ve. 

If V is a pseudovariety of monoids, we say that the monoid M £ K @ V if M/~k £ V, 
and M € D(m)V if M/~d S V. The classes K@ V and D@V are pseudovarieties as well, 
which are usually defined in terms of Mal'cev products with the pseudovarieties K and D, 
see [231 Thm 4.6.50] or [I2l[l0]. 

The following equalities are well-known |20] : 

K@Ji = K@J = R, D@Ji = D@J = L. 

3. Condensed rankers 

Our main tool to approach the decidability of FO^-definability lies in the notion of con- 
densed rankers, a variant of rankers which was introduced implicitly by Weis and Immerman 
to prove Theorem 16.41 below (see [37,. Theorem 4.7]). Recall that a ranker can be seen as a 
sequence of directional instructions (see Example II. 3p . We say that a ranker r is condensed 
on u if it is defined on u, and if the sequence of positions visited zooms in on r(u), never 
crossing over a position already visited, see Figure [H Formally, r = Zi • • • Z„ is condensed 
on u if there exists a chain of open intervals 

(0; \u\ + 1) = («o; Jo) ^ (n; Ji) ^ • • • ^ {in-rjn-i) 3 r{u) 
such that for all 1 < £ < n — 1 the following properties are satisfied: 
• If ZiZi+i = XaXb then (iije) = (Xa(n, i^_i), j^_i). 
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• If Z^Z^+i = YaYfe then {ie,je) = Ya(n, 

• If ZiZi+i = XaYb then (ieJi) = Xa(n, 

• If Z^Z^+i = YaXfe then (ieJi) = {Vaiu, je-i), je^i)- 

Remark 3.1. The ii and jg are either or 1 + or positions of the form r'{u) for some 
prefix of r' of r. More precisely, if ri is the depth i prefix of r < n), then r^(u) = ii if 
Z£_|_i is of the form Xq, and r£{u) = ji if Z^+i is of the form Yq. 

Remark 3.2. If r = rir2 is condensed on n, then r{u) > ri{u) if r2 starts with an X-letter, 
and r{u) < ri{u) if r2 starts with a Y-letter. 

Example 3.3. The ranker X^YfeXc is defined on the words bac and bca, but it is condensed 
only on bca. 

Rankers in „ and rankers of the form X^Yfe^ • • • Y^^. or Ya^bi ' ' ' condensed on 

all words on which they are defined. 

Condensed rankers form a natural notion, which is equally well-suited to the task of 
describing FO^-definability (see Theorem l3.11l below). With respect to TL, for which Propo- 
sition [L6] shows a perfect match with the notion of rankers, they can be interpreted as adding 
a strong notion of unambiguity, see Section [6.31 below and the work of Lodaya, Pandya and 
Shah [IT]. 

Let us say that two words u and v agree on condensed rankers from a set R of rankers, 
if the same rankers in R are condensed on u and v. We write u \>m,n v (resp. u <m,n v) if u 
and V agree on condensed rankers in i?^ „ (resp. R^ „). 

If r is a ranker, let Lc{r) be the language of all words on which r is condensed. We 
define TZm (resp. Cm) to be the Boolean algebra generated by the languages of the form 
Lc(r), r G (resp. n > m. 

3.1. Technical properties of condensed rankers. A factorization u = u^au^ of a word 
u € A* is called the a-left factorization of u ii a ^ alph(ii_). Symmetrically, u = U-au^ is 
the a-right factorization of u if a ^ alph(ti+). Thus, the a-left (resp. a-right) factorization 
of a identifies the first occurrence of a when reading u from the left (resp. the right). 
Lemmas 13.41 and 13.51 admit an elementary verification. 

Lemma 3.4. Let s be a ranker, a (z A and r = X^s. Let also u G A~^ and let u = u-ou^ 
be its a-left factorization. Then r is condensed on u if and only if 

• s is condensed on n-(_ if s starts with an X-block; 

• s is condensed on n_ if s starts with a Y -block. 

A dual statement holds if r is of the form r = Y^s, with respect to the a-right factorization 
ofu. 

Lemma 3.5. Let r be a ranker and a A. Let also u G A'^ and let u = u-ouj^ be its a-left 
factorization. 

If r starts with an X-letter, then 

• r is defined on n_ if and only if r is defined on u, r does not contain Xq or Yq and, for 
every prefix p of r ending with an X-letter, pYa is not defined on u. 

• r is condensed on u- 

— if and only if r is defined on u_ and condensed on u, 

— if and only if r is condensed on u, r does not contain Xa orYa and, if p is the maximal 
prefix of r consisting only of X-letters, thenpYa is not defined on u. 
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— if and only if r is condensed on u, r does not contain Xa or Ya and, if p = X^^ • • • X{,j, 
(k > 1) is the initial X-block of r, then XaY^j, • • • is defined on u. 

• r is defined on Uj^ if and only if XaT is defined on u and, for every prefix p of r ending 
with a Y -letter, XapYa is defined on u. 

• r is condensed on u+ if and only ifXar is condensed on u. 
If r starts with a Y -letter, then 

• r is defined on U- if and only if Xar is defined on u, r does not contain Xa or Ya and, 
for every prefix p of r ending with an X-letter, XapYa is not defined on u. 

• r is condensed on U- if and only if Xar is condensed on u. 

• r is defined on n+ if and only if r is defined on u and, for every prefix p of r ending with 
a Y-letter, pYa is defined on u. 

• r is condensed on u+ 

— if and only if r is defined on u+ and condensed on u, 

— if and only if r is condensed on u and, if p = Yb^ ■ ■ - Y^^. (k > 1) is the initial Y -block 
of r, then pYa is defined on u. 

We also note the following, very useful characterization of the relations t>rn,n and <m,n- 

Proposition 3.6. The families of relations >rn,n oind <m,n (n > m > 1) are uniquely 
determined by the following properties. 

(1) u >i^n V if and only if u <li^„ v, if and only if u and v have the same subwords of length 
at most n. 

(2) Ifm, > 2, thenu >rn,n v if and only z/alph(n) = alph(t'), u Om-i.n-i v and for each letter 
a € alph(n), the a-left factorizations u = n_an+ and v = V-av^ satisfy U- <m-i,n-i V- 
and U+ Om.n-l V+ (u+ >m-l,n-l v+ if u = m). 

(3) If m > 2, then u <rn,n v if and only if alph(u) = alph(f), u >m-i,n-i v and for 
each letter a G alph(u), the a-right factorizations u = U-OUj^ and v = v^avj^ satisfy 

U+ I>m_i,„_i V+ and U- <m,n-l V- (u^ <rn-l,n-l V- if U = m). 

Proof. Statement (1) follows directly from Examples 11.41 and 13.31 Let us now assume that 
m > 2. 

Suppose that alph(n) = alph(i;), u <m-i,n-i v and for each a € alph(n), the a-left 
factorizations u = U-au+ and v = f_au+ satisfy U- <m-i,n-i V- and n+ t>m,n-i v+ if 
n > m {u+ l>m-i,n-i f + if n = m). Let r € i?mra condensed on u. If r starts with a 
Y-letter, then r G i^m-in-ii and hence r is condensed on v since u <rn~i,n-i v. If instead 
r starts with an X-letter, say r = X^s, we consider the a-left factorizations of u and v. If s 
starts with a Y-letter, then s G i^m-i n-i) ^ condensed on u_ (Lemma 13. 4p and hence s is 
condensed on since u_ <im-i,n-i V-., from which it follows again that r is condensed on 
V. Finally, if s starts with an X-letter, then s is condensed on by Lemma [3.41 Moreover, 
s E i?mn-i n > m. If n = m, we have in fact r € ^^_in (since r starts with two 
X-letters) and hence s € Since ii+ i>m,n-i f + if n > m and u+ i>m-i,n-i Vj^ if 

n = m, it follows that s is condensed on v-^-, and hence r is condensed on v. 

Conversely, let us assume that u >rn,n v, that is, u and v agree on condensed rankers 
in I^n- Considering rankers in R^^ C i?^^ shows that alph('u) = alph(u). Similarly, 
considering rankers in li^_i „^_i Q I^^n shows that u <m-i,n-i v. Finally, let a G alph(u) 
and let u = u-au+ and v = w_af+ be a-left factorizations. 

Let s G l£ln_in-i be condensed on n_. Note that s contains neither X^ nor Y^, since 
a ^ alph(u_). If s starts with a Y-letter, then r = X^s is condensed on u (Lemma I3.4p and 
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since r G Rmn^ ''' is condensed on v as well, which implies that s is condensed on v^. If 
instead s starts with an X-letter, then s is condensed on u and hence on v. Moreover, if 
p = Xf,^ • • • Xfo^ is the maximal prefix of s consisting only of X- letters, then XaY^j. • • • Yf,j € 
i?2n is condensed on u (Lemma 13. 5p . Since i?2n — i^mrn is condensed on v as well and 
hence, s is condensed on v-. 

Finally, assume that s € ii^n-i (iim-i n-i if ?^ = "m.) is condensed on n+. The 
reasoning is similar: if s starts with an X-letter, then X^s G i?^ ^ is condensed on u. 
Therefore X^s is condensed on v and s is condensed on If instead s starts with a Y- 
letter, then s is condensed on u and s G i?^_i „_2 (i?m-2,n-2 if = "i). In particular 
s € i?^ „ and hence, s is condensed on v as well. Moreover, if p is the initial Y-block of s, 

then pYa is condensed on u. Note that pYa £ i?T,n-i ^ ^n,ni so pY^ is condensed on w and 
s is condensed on f+. □ 

Lemma 3.7. Let n > m > 2, u,v A* , a € A and let u = u^au^ and v = ^;_a^;+ be a-left 
factorizations. If u >rn,n v, then u- >m,n-i v- (u^ \>rn-i,n-i V- if n = m). And if u <m,n v, 
then w+ <m,n-i Vj^ (u+ <\m-i,n-i v+ if u = m). Dual statements hold for the factors of the 
a-right factorizations of u and v if u <m,n v or u >rn,n v. 

Proof. We give the proof if n > m; it is easily adapted to the case where n = m. 

Assume that u >m,n v and r G i?mn-i is condensed on u-. By Lemma [331 we have: 

- If r starts with an X-letter, then r is condensed on u, r does not contain occurrences 
of Xa or Yq, and if p = Xb^ ■ ■ ■ X^^ is the initial X-block of r, then q = XaY^j, • • • Y^^ is 
condensed on u. Since q G R^k+i k < n, we have q G and hence r and q are 
condensed on v. Therefore r is condensed on v-. 

- If r starts with a Y-letter, then X^r is condensed on u. But r G i?^_i „_2, so 
X^r G i?mn and hence X^r is condensed on v. It follows that r is condensed on v^. 

Assume now that 

^ ^m^n and r G n— i is condensed on Then 

- If r starts with an X-letter (which is possible only if m > 2), then r G ^R^-i n-2 and 
X^r is condensed on n. But X^r G i?m-i n-i ^ Bmn: so X^r is condensed on u and r is 
condensed on f+. 

- If instead r starts with a Y-letter, then r is condensed on u and if p is the initial 
Y-block of r, then pY^ is condensed on u. But r,pYa G R^^n^ so r and pY^ are condensed 
on i;, and r is condensed on □ 

3.2. Condensed rankers, rankers and FO^. We now show that, in the characterization 
of J-^O'^ „ in Theorem 11.81 condensed rankers can be used just as well. This is done in 
Theorem 13. Ill The first step is to relate agreement on rankers and agreement on condensed 
rankers. We start with a technical lemma. 

Lemma 3.8. // a ranker r G ii^n ^ {^i^^}) ^-^ defined but not condensed on u, and if 
s is the maximal prefix of r which is condensed on u, then one of the following holds, for 
some i > 1: 

• r = sXfjt, s = soYaXfcj • • • Xf,^_^ and soYa(u) < s{u) < so(n) < sXf,{u); 

• r = sYbt, s = soXaYb^ ■ ■ ■ Y^^ ^ and soXa{u) > s{u) > so(n) > sYb{u) = soYfc(n). 

Moreover sq is not empty, sq G ii^-in-^'' ^^biu) = so{u) (resp. sYb{u) = sq{u)) if the last 
letter of sq is in {Xf,,Yfe}; and sXf,(n) = soXb{u) (resp. sYb{u) = soYb{u)) otherwise. 
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Proof. Rankers in Ri^n are condensed on each word on which they are defined (Example [33]). 
Therefore we have m > 2. 

By hypothesis, s ^ r. We consider the case where the first letter after s is an X-letter, 
the other case is dual. Then r is of the form r = sXf,t, where t may be empty. In view of 
Example 13.31 s = soYaX-bi ■ ■ ■ '^be_i for some non-empty sq and £ > 1. Since s is condensed 
on u but sXf) is not, we have the following (see Remark 13. 2p : 

soYa(n) < soYaXbiiu) ■ ■ ■ < soYaXftj • • •Xfc^_j(n) = s{u) < so{u), 

and sXft(n) > sq{u). More precisely, sXb{u) is the first 6-position to the right of s(n), so 
sXfe(u) = so{u) if so{u) is a 6-position (i.e., if sq ends with X;, or Y;,), and sXh{u) = soXh{u) 
otherwise. □ 

Proposition 3.9. Letn > m > 1, n, w S A'^ andZ € {X,Y}. Ifu andv agree on condensed 
rankers in and if r ^ i^-mn ^'^ defined on both u and v, then there exists r' G i^mn 

which is condensed on u and v and coincides with r on both words. 

Proof. The result is trivial if m = 1, since rankers in Ri^n are condensed on each word on 
which they are defined (Example 13. 3p . We now assume that m > 2. 

Let p and q be positions in u and v and let r G i?^ ^ such that r{u) = p and r(v) = q. 
If r is not condensed on n, then r is not condensed on v (since the two words agree on 
condensed rankers). With the notation of Lemma 13.81 r coincides on both u and v with 
r' = Sot, soXfet or soYfet (depending on the last letter of sq and on the letter following s in 
r), which starts with the same letter as r. If r' is not condensed on u and v, we repeat the 
reasoning. This process must terminate since each iteration reduces the depth of r'. □ 

Proposition 3.10. Let n > m > 1, u,v (z and Z E {X, Y}. Ifu and v agree on 
condensed rankers in I^n! then they agree on rankers from the same class. 

Proof. If u and v do not agree on rankers from i2m,n' ^ ^ i^m.n ^ minimum depth 
ranker on which u and v disagree. Without loss of generality, we may assume that u G L{r) 
and V ^ L{r). In particular, r is not condensed on u. 

Let s, So and t be as in Lemma 13.81 Without loss of generality again, we may assume 
that the letter following s in r is X^. Since s is condensed on u and sX^ is not, the ranker 
s is condensed on v and sXb is not. Moreover, sX^ coincides on u with s' = sq, or soXb, 
depending on the last letter of sq- Observe that s' is shorter than r, so s' is defined on v. 
In particular, there exists a 6-position in v to the right of s, which is not to the left of sq 
(since sXb is not condensed on v). It follows that sXb{v) = s'{v). Let now r' = s't: then r' 
is shorter than r, it coincides with r on u, and it is not defined on v since s' coincides with 
s on that word. This contradicts the minimality of r. □ 

We can now prove the following variant of Theorem 11.81 

Theorem 3.11. Let u,v G A* and let 1 < m < n. Then u and v satisfy the same formulas 
in FO^ „ if and only if 

(WI Ic) u and v agree on condensed rankers from Rmw 

(WI 2c) if the rankers r G Rmn ^''^d r' G R-m-in-i ^'"^ condensed on u and v, then 

oi:d{r{u),r'{u)) = oicd{r{v),r'{v)). 
(WI 3c) if r £ R^ „ and r' G R^ „„x '^^^ condensed on u and v and end with different 

direction letters, then ord{r{u),r'{u)) = ord{r{v),r'{v)). 
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Proof. We need to prove that together, Properties (WI 1), (WI 2) and (WI 3) are equiv- 
alent to Properties (WI Ic), (WI 2c) and (WI 3c). 

Let us first assume that (WI 1), (WI 2) and (WI 3) hold. It is immediate that (WI 
2c) and (WI 3c) hold. If (WI Ic) does not hold, let r be a ranker in ^ which is 
condensed on v and not on u. Since (WI 1) holds, r is defined on u. Let sq, s and t be as 
in Lemma 13.81 and let us assume, without loss of generality, that the letter following s in r 
is Xfe. Then sq and sXh are defined on both u and v, with sq E R-m-i n~i ^'^^ S n- 
Since r is condensed on f , we have sXb{v) < so{v), and since sXi, is not condensed on n, we 
have so{v) < sXf,(t>), contradicting Property (WI 2). Thus (WI Ic) holds. 

Conversely, let us assume that (WI Ic), (WI 2c) and (WI 3c) hold. Then (WI 1) 
holds by Proposition 13.101 Let us verify Property (WI 2): suppose that r S Rmn ^'^d 
r' e R^_ 

-in-i defined on u and v. In view of (WI Ic), Proposition \3M shows that 
there exist rankers s € Rj^ „ and s' £ Rm-i n-i which are condensed on u and v, and which 
coincide with r and r', respectively, on both words. By (WI 2c), we have oid{s{u), s'{u)) = 
ord(s(f ), s'(f )), and hence ord(r(n), r'(n)) = ord(r(t'), r'(t;)). Thus Property (WI 2) holds. 
The verification of (WI 3) is identical. □ 

These results imply the following statement, which refines Corollary 11.91 and can be 
proved like that Corollary, using Propositions 11.61 and 13.101 and Theorem 13.111 

Corollary 3.12. For each m>l, we have 7X^ C 7^m C TO^^ and C C TOl,. 



3.3. Condensed rankers determine a hierarchy of varieties. We now examine the 
algebraic properties of the relations >rn,n and <m,n- 

Lemma 3.13. The relations >m,n o-n-d <m,n olt^ finite-index congruences. 

Proof. The relations \>m,n and <m,n are clearly equivalence relations, of finite index since 
R^ „ is finite. We now verify that if 6 G >1 and if u and V are i>^^72-equivalent, then so are 
uh and vh (resp. bu and bv). 

The proof is by induction on m+n. The property of having the same subwords of length 
n is easily seen to be a congruence (and the proof of this fact can be found in [20j as it is 
related to Simon's theorem on piecewise testable languages). In view of Proposition 13.61 (1). 
this shows that c>i^„ and <ii^„ are congruences. 

Let us now assume that n > m > 2 and u >rn,n v. By Proposition 13.61 (2), we have 
alph(ti) = alph('t;) and u <m-i,n-i v. It follows that a\ph{ub) = alph(6u) = a\ph{vb) = 
alph(6f ), and that ub <rn-i,n-i vb and bu <m-i,n-i bv by induction. 

Let now a G alph(u) U {b}. If a G alph(ti) and if n = u^au^ and v = v^av^ are a-left 
factorizations, then the a-left factorizations of ub and vb are U- a (u^b) and V- a (f+fe). 
And the a-left factorizations of bu and bv are (bu-) a u^ and (bv-) a u+ — unless a = 
b, in which case these factorizations are ebu and ebv. By Proposition 13.61 (2) we have 
u„ <m-i,n-i V- and u^ >m,n-i v+ («+ l>m-i,n-i if n = m). By induction, we have 
U >m,n^l V, bu^ <rn~l,n-l bV- and U+b >m,n^l v+b {u^b i>m-l,n-l v+b if n = m). 

If a alph(u), and hence a = b, the a-left factorizations of ub and vb (resp. bu and 
bv) are ube and vbe (resp. ebu and ebv), and we do have u <m-i,n-i v and u i>m,n-i v 
(u i>m-i,n-i f if m = n). 

Thus all the conditions in Proposition 13.61 (2) are satisfied, whether a occurs in u and v 
or not, and we have established that ub >m,n vb and bu \>m,n bv. The proof regarding <\m,n 
is symmetric. □ 
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Lemma 3.14. If ip: A* — )• B* is a morphism and if u,v G A* are >rn,n-eQuivalent (resp. 
<m,n-Gquivalent) , then so are ip{u) and ^{v). 

Proof. We carry out the proof for the congruence >m,n by induction on m + n. The proof 
for <m,n is symmetrical. 

For m = 1, we show that if a ranker r € is condensed on (f{u), then it is condensed 
on (p{v). If u = ai ■ ■ ■ ai, the word (p{u) has a natural factorization in blocks, namely the 
(p{ai) and the sequence of positions in f{u) defined by the prefixes of r visits (some of) the 
99(ai)-blocks. This yields a factorization of r, r = rir2 ■ ■ - r^, where all the positions in ip{u) 
visited while running ri are in the same block, say, (p^aji^i-j); then all the positions visited 
by the prefixes of r between ri (excluded) and rir2 (included) are in the block 93(0^(2)) with 
j{2) > i(l); and so on. In particular, the ranker X^^.^^j • • • X^^.^^j is defined on n, and hence 
on V. Therefore v = foaj{i)'yi ' ' ' o,j(k)'^k+i- By construction, each is defined on ip{aj(^i-)), 
so r is defined on ip{v), and condensed on that word (Example 13. 3j) . 

We now let n > m > 2 and 

^ ^m,n ^* If is immediate that 3lph((p('u)) — 3lph((^('u)) since 
u and V have the same alphabet. By Proposition 13.61 (2), we have u <m-i,n-i v, and by 
induction it follows that (p{u) <rn-i,n-i ^(v)- Let now b G a\ph{ip{u)) and let (p{u) = x^bx^ 
and (p{v) = u-by^ be 5-left factorizations. The occurrence of b thus singled out in (p{u) sits 
in some (p{a), a (z A, and the corresponding occurrence of a in n is the leftmost one: we 
have an a-left factorization u = u-au+ and a 6-left factorization ip{a) = x'bx" such that 
X- = ip{u^)x' and Xj^ = x"(p{u+). Similarly, the leftmost occurrence of b in ip{v) sits in 
some ip{a'), a' € A: a' is the leftmost letter in v such that b occurs in (p{a'). If a' 7^ a, 
the consideration of the rankers XaVa' and Xq/Yq, which are simultaneously defined or not 
defined on u and v, yields a contradiction. Therefore a' = a and if f = f_au+ is the a-left 
factorization, then y_ = (p{v^)x' and y+ = x"(p{v-\.). By Proposition 13.61 (2) again, we have 
U- <m-i,n~i V- and n+ >m,n~i (n+ >m-i,n~-i f + if n = m). By induction, it follows 
that the same relations hold between the (/j-images of U-, v^, and and we have 
X- <m-i,n-i U- and x+ t>m,n.-i y+ {x^ i>m-i,n-i y+ ^f n = m) by Lemma [3.131 Therefore 
(p{u) >m,n ^{v) by Proposition 13.61 (2). □ 

For each n > m > 1, let Rm,n (resp. Lm,n) be the pseudovariety of monoids generated 
respectively by the monoids of the form A* /\>m,n (resp. A* / <im,n)- Since i>m,n' refines \>m,n 
when n' > n, the sequence (Rm,n)n is increasing and we let be its union (a pseudovariety 
as well) . The pseudovariety L^ is defined similarly, as the union of the Lm,n • 

Corollary 3.15. Ifj:A* M is a morphism into a monoid in Rm,nj then there exists 
a morphism 13: A* / \>rn,n^ M such that 7 = /3 o tta, where tta- A* — )• A*/ >m,n is the 
projection morphism. The same result holds for Ijm,n o^'^d the quotient A* /<m,n- 

Proof. By definition, there exists an onto morphism J : — > M, and an injective morphism 
i: N ^ A'l/ >rn,n X • • • X A^^./ >m,n- L^t B be the disjoint union of the Ai, and for each i, let 
TTi be the morphism from B* to A* which erases all the letters not in Ai. By Lemma 13.141 
t>m,n-equivalent elements have t>m,n-equivalent images, so we have a morphism vr : B* / >m,n—^ 
Ui^i/^m,n as in Figure [21 

For each letter a G ^, we then pick a word (p{a) in 7r^^Tr~^6~'^^{a) C B*: this defines 
a morphism ip: A* ^ B* such that 6 o n o ttb o = 7. By Lemma 13.141 again, there exists 
a morphism ■0 : A* / \>rn,n—^ B* / \>m,n such that ijj o-ka = t^b ° V'- It follows that if u >rn,n v, 
then TTBf{u) = ttb^{v), and hence 7(n) = 7(f). This concludes the proof. □ 



LOGICAL HIERARCHIES WITHIN FO^-DEFINABLE LANGUAGES 



15 




*■ ^1/ ^m,n X • • • X j4^/ 



Figure 2: A commutative diagram 



Corollary 3.16. For each m > 1, TZm and Cm are varieties of languages and the corre- 
sponding pseudovarieties of monoids are Hm and . 

Proof. Every Lc{r) (r G i^mn) is a union of t>m,n-classes, and hence it is recognized by 
A*/ 

f>m,n- Therefore every language in TZm is recognized by a monoid in R-m (and indeed, 
by TTA- A* ^ A*/>m,n for n large enough). 

Conversely, suppose that L C. A* is recognized by a morphism A* ^ M, into a 
monoid M € R^. Then M G R 

m,n for some n > m and by Corollary 13.15^ there exists a 
morphism f3: A* / >m,n^ M such that 7 = /3 o vr^. It follows that L is also accepted by tt^, 
L is a union of c>m,n-classes, and hence L G TZm- D 

Example 3.17. It follows from Proposition 13.61 (i) that TZi = L\ is the variety of piecewise 
testable languages, and Ri = Li = J, the pseudovariety of j7-trivial monoids. 

Remark 3.18. The proof of Corollary 13.161 also establishes that for each n > m > 1, the 
Boolean algebra TZm.n generated by the languages of the form Lc{r), r G ^nni defines a 
variety of languages, for which the corresponding pseudovariety of monoids is Rm,rf In 
variety-theoretic terms. Corollary 13.151 states that A* / >m,n is the free object of Rm,n over 
the alphabet A. The symmetrical statement also holds for Cm,n and the monoids A* / <m,n- 

We note the following containments. 

Corollary 3.19. For each m > 1, R^ and L„i are contained in DA, and also in Rm+i H 

Lm+l- 

Proof. Since J-O'^ is the variety of languages corresponding to DA, Corollary [3T2] yields the 
containment of Rm and in DA. Similarly, TZm and Cm are contained in both TZm+i and 
Cm+i by definition of these classes of languages - and this in turn implies the containment 
of the corresponding pseudovarieties. □ 



3.4. Condensed rankers and deterministic products. Recall that a product of lan- 
guages L = LqOiLi ■ ■ ■ OkLk {k > 1, a, (z A, Li CI A*) is deterministic if, for 1 < i < k, each 
word u £ L has a unique prefix in LqOiLi ■ ■ -Lj-iaj. If for each i, the letter Oj does not 
occur in Li-i, the product LqOiLi ■ ■ ■ akL^ is called visibly deterministic: this is obviously 
a particular case of a deterministic product. 

The definition of a co- deterministic or visibly co- deterministic product is dual, in terms 
of suffixes instead of prefixes. If V is a class of languages and ^ is a finite alphabet, 
let V^'=*(y4) (resp. V"'^''\A), V'^'"^^*(^), V''™^'=*(A)) be the set of ah Boolean combinations of 
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A* ^ A*/<m,n 

7 /3 

M ^ Af/~K 

Figure 3: M G K @ 

languages of V{A) and of deterministic (resp. visibly deterministic, co-deterministic, visibly 
co-deterministic) products of languages of V{A). 

Pin gave algebraic characterizations of the operations V i — > V"'^* and V i — > ycodet^ 

[iai22]. 

Proposition 3.20. If V is a variety of languages and if\ is the corresponding pseudova- 
riety of monoids, then V^^* and are varieties of languages and the corresponding 

pseudovarieties are, respectively, K @ V and D @ V. 

This leads to the following statement. 

Theorem 3.21. For each m>l, we have Um+i = -C^"* = Cm+i = TZ^ii"'^^^ = 
Rm+i = K @ Lm and L^+i = D @ Rm- In particular, R2 = R and L2 = L. 

The proof uses the following technical property of monoids in DA, whose proof can be 
found for instance in [BJ Lemma 4.2]. 

Fact 3.22. Let a: A* ^ She a. morphism into a monoid S G DA. If n, u E A*, a G alph(f ) 

and (y{u) TZ a{uv), then a{uva) TZ a{u). 

Proof of Theorem \3.21\ It is immediate from the definition that C^'^^ C >C^*. 

Let u (z A* and let B = alph('u). For each a G i?, let u = uf^"^ au"^ be the a-left 
factorization of u. Let \B] be the language of all strings with alphabet B, [B] = {u G A* | 
alph(ii) = B}. Observe that 

[B] = fl L,(XJ \ U ^c(Xa) = fl Le(Y„) \ U ^c(Ya). 

aGB a^B a&B a^B 

This shows that [B] G IZi = L\. (It is also well-known that \B\ is piecewise testable, and 
hence \B\ ^11^ = Lx) 

Now let n > m > 1. It follows from Proposition 13.61 that the i>m+i,n-class of u is the 
intersection of \B\, the <im,n-i-class of u and the products KaL [a G -B) where K is the 

Om.n-i-class of u^f^ and L is the i>m+i,n-i-class of u^'^ if n > m + 1, the i>m,n-i-class of 
if n = m + 1. 

By definition of an a-left factorization, each of these products is visibly deterministic 
and, since every <im^„_i-class is a lang class of 

u is in Cy^^K Thus Tl^n+i C Cy^^K 

To establish the last inclusion, namely CJ^ C T^^+i, we rather show K@Lm ^ Rm+i- 
Let 7: ^* — > M be a surjective morphism, onto a monoid M G K ! we want 

to show that there exists a morphism from A* j [>m+i,n onto M for some n > m. Since 
M G K L^, the monoid JVI ^ G and by Corollary 13. 15t there exists an integer n 
and a morphism (3: A* /<rn,n M/~k such that /3 o a = vr o 7, where a is the projection 
from A* onto A* /<m,n and tt is the projection from M onto M/~k, see Figure El 
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Let i be the maximal length of a strict 7?.-chain in M, that is: if <'r, ■ ■ ■ <tz xi in 
M, then k < £. We show that, for any u,v £ A* , 

U >ra+l,i\A\+n+l V =^ -f{u) = -f{v). (3.1) 

If n' = l\A\ + n + 1, this implies the existence of a morphism from A* / >m+i,n' onto M, 
as announced. 

To prove implication (j3.ip . it suffices to show that we have 

U l>m+l,^|alph(«)|+n+l V =^ l{u) = (3.2) 

which we prove by induction on |alph(M)|. If |alph(u)| = 0, then u = e, alph(f) = and 
V = e as well, so that j{u) = 7(f). 

Now suppose that u ^ e and assume that u >m+i/\a\ph{u)\+n+i ^- Let u = uiai ■ ■ ■ akUk+i 
be the factorization of u such that each Uj is a word, each Oj is a letter and 

1 7^ 7(ui) >Tz j{uiai) ■■■ >n 7(niai • • • Ukak) TZ j{uiai ■ ■ ■ akUk+i). 

Then k + 1 < so k < i. Moreover, by Fact 13.221 (and Corollarv I3.19p . for each 1 < i < /c, 
Qi ^ Uj, so that each product Uiai{ui+i ■ ■ ■ akUk+i) is an Oj-left factorization {I < i < k). 
An easy induction on k, using Lemma 13.71 shows that v can then be factored as 

V = viaiV2 ■ ■ ■ akVk+i, 

where Ui i>m+i/|aiph(u)|+n-i+i for each 1 < i < k+1. Moreover, for 1 < i < /c, |alph(tti)| < 
|alph(u)|. Since i < k < £, we have £\a\ph{u)\ + n — i > ^|alph(tij)| + n + 1, and by induction, 
we have 7(ui) = j{vi). However, it is possible that alph(nA;+i) = alph(u), so we cannot 
conclude that j{uk+i) = 7(^^+1). 
But we do have the following: 

Uk+i >m+i/|aiph(«)|+n-fc Vk+1 and j{u') = 7(7;'), 

where u' = uiai ■ ■ ■ UkUk and v' = viai ■ ■ ■ VkO-k- The first relation implies that Uk+i and 
Vk+i are <im/|aiph(u)|+n-A;-i-6quivalent. Since k < £, we have ^|alph(?i)| + n — k — l>n, so 
Uk+i <m,n Vk+1 and hence, Trj{uk+i) = 7rj{vk+i), that is, 7(ufc+i) ~k 7(^^A:+i)- 

Moreover, there exists a string x A* such that 7(u') = ^{u'uk+ix). Let uj be an 
integer such that every tj-power is idempotent in M: then ^{u') = ^{u')^{uk+ix)^ . 

Now observe that '^{uk+ixY J '^{uk+ix)^^{uk+i), since '^{uk+ix)^ = ^{uk+ix)^'^ . It 
follows from 7('Ufe+i) ~k 7(^^fc+i) that '^{uk+ixy-f{uk+i) = 'y{uk+ix)'^'y{vk+i)- Therefore 
we have 

l{u'h{uk+i) = l{u'hivk+i) and hence 

j{u) = 7(n')7(tifc+i) = j{u')'y{vk+i) = 'y{v')'y{vk+i) = 7(7;). 
This concludes the proof of Formula (j3.2p , and therefore of Theorem 13.211 □ 

3.5. Structure of the and hierarchies. It turns out that the hierarchies of 
pseudovarieties given by the and the were studied in the semigroup-theoretic lit- 
erature (Trotter and Weil [35], Kufleitner and Weil [E]). In [15], they are defined as the 
hierarchies of pseudovarieties obtained from J by alternated applications of the operations 
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X I— )• K @ X and X i-^ D @ X. Theorem 13.211 shows that these are the same hierarchies as 
those considered in this papeiQ- The following results are proved in [15\ Section 4]. 

Proposition 3.23. The hierarchies (Rm)m md (Lm)m ^'"e infinite chains of decidable 
pseudovarieties, and their unions are equal to DA. Moreover, every m-generated monoid 
in DA lies in Rm+i H L^+i • 

The results in [35\ [T5] go actually further, and give defining pseudoidentities for the 
pseudovarieties Rm and L^- 

Remark 3.24. The way up in the Rm-L^ hierarchy, by means of Mal'cev products with 
K and D, is strongly reminiscent of the structure of the lattice of band varieties [8]. This 
observation is no coincidence, and forms the basis of the results in p5] which are used here. 



4. The Rm hierarchy and unary temporal logic 

We have seen in Corollary 13.121 that TC ^, ^ and 7X^ C In Theorem 14.31 below, 
we prove a weak converse. Let us however make the following observation. 

Proposition 4.1. We have 

7X^ = 7X^' = 7^l = A, 

7X ^ = 72-2, 7XT = £2- 

Proof. The statement concerning TLi was already proved in Remark 11.71 Let us now 
establish that 7^2 ^ TX q . We show, by induction on n > 2, that if u and v agree on 
rankers in i?2 2ni then they agree on condensed rankers in i?2 „: u >2,n v. We use the 
characterization of >2,n in Proposition 13.61 

The consideration of 1-letter rankers shows that alph(ti) = alph(t>). Moreover, since 
I^in-i is contained in ^2 2n) since these rankers are condensed where they are defined, 
we find that u <i^n-i v. Similarly, let u = U-au-y. and v = V-ovj^ be a-left factorizations, 
and let s € ^n-i- Then s is condensed on U- if and only if s is defined on n_, if and only 
if XaS is defined on u (Lemma 13. 5p . Since X^s G i?2 2n ^"^^ ^ v agree on such rankers, 
it follows that X^s is defined on v, and s is condensed on v-. Thus ii_ <ii,n-i V-. 

Now we need to show that u+ i>2,n-i w+ if n > 3, u+ Oi^i W4. if n = 2. Suppose first 
that n = 2 and consider s € condensed on u+. Then s = Xf, for some b £ A and the 
consideration of r = X^X;, (in i?2 2) shows that s is condensed on vj^ as well. This settles 
the case n = 2. 

Let us now assume that n > 3 and let us show that u+ i>2,n-i '^+- By induction, it 
suffices to show that w+ and v+ agree on rankers in i?2 2n-2- So let s G ^2n-2 defined on 
u+. Then for every prefix p of s ending with a Y-letter, XapYa is defined on u (Lemma [33]). 
Since XapVa £ i?2 2n) i* follows that XaP^a is defined on u, and hence s is defined on vj^. 
This concludes the proof. □ 

"'^More precisely, the pseudovarieties Rm and Lm in TSl are pseudovarieties of semigroups, and the Rm 
and Lm considered in this paper are the classes of monoids in these pseudovarieties. 
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Example 14.21 below shows that the statement of Proposition 14.11 cannot be extended to 
the higher levels of the hierarchy. 

Example 4.2. We show in this example that 7X^ is properly contained in TZ^. More 
precisely, let tq = XaY^Xc G -^33- We show that Ldro), a language in 7^3, is not TL^- 
definable. 

Let Un = (6c)"'(a(6c)")" and Vn = {bcYb{a{bcYY ^ I* is easily verified that ro 
is condensed on n„, and that it is defined and not condensed on u„: that is, for each n, 
Un G Lc{ro) and Vn ^ Lc{ro). 

We now show that n„ and Vn agree on all rankers in i?3 so that any TL3 -definable 
language contains either both Un and Vn, or neither - and hence Lc{ro) is not TL^ -definable. 

Let r G I^n- starts with a Y-letter, then any two words ending with {a{bc)^)^ 
agree on r. In particular, ii„ and Vn agree on r. Similarly, if r starts with an X-letter and 
does not contain the letters Xq or Yq, then any two words starting with (6c)" agree on r, 
so Un and Vn agree on r. 

(1) (k) 

Finally, assume that r starts with an X-letter and that r = soZa si---Za Sk with 
k > 0, each Z^*) G {X, Y} and each Si a (possibly empty) ranker avoiding the letters X^ and 

Y(j. We denote by pi the prefix pi = sqZ'^^ si ■ ■ ■ zSa ■ 

Suppose first that r G -Rf „• Then pi coincides with XJj on u„ as well as on Vn- Therefore 
r is defined and coincides with X^Sfc on both words. 

Suppose now that r G -R2 m say ^ = 'r''^" with r' a non-empty string of X-letters and r" 
a non-empty string of Y-letters. If r' is shorter than pi, then Z^^^ = Y and r is not defined 
on either Un or Vn- If Z^^^ = X, let i be maximal such that pi is a prefix of r', say r' = Pis[. 
Then i > and pi coincides with X^ on as well as on Vn- 

If i = fc, then r is defined on m„ and f„, and it coincides with X^Sfc on both words. 

If 1 < i < /c, s- is non-empty and is defined on (6c)", then pj+i coincides with X^ 
on Un and Vn- Thus r is defined on Un (resp. f,i) if and only i > k — i, and in that case, it 
coincides with X^~^*Sfc. 

If 1 < i < k, s'^ is non-empty and Si is not defined on (6c)", or if s- is empty, then p^+i 
coincides with X^~^ on Un and Thus r is defined on u„ (resp. Vn) if and only i > k — i, 
and in that case, it coincides with X^~^*~^Sfc. 

Finally, if 1 = i < fc, s- is non-empty and is not defined on (6c)", or if s- is empty, 
then r (and even Pi+i) is not defined on either u„ or Vn- 

Finally, let us assume that r G -R3 say r = r'r"r"' with r' and r'" non-empty strings 
of X-letters and r" a non-empty string of Y-letters. Again, let i be maximal such that pi is 
a prefix of r' (i = if pi is not a prefix of r') and let j be maximal such that pj is a prefix 
of r'r". Then r'r" = pjs'j for some prefix s'j of sj. By the previous analysis, if i < 1 < j, 
then r'r" is not defined on Un nor on and hence neither is r. In all other cases, r'r" is 
defined on both words and coincides with Y^~^ s'^ or y^~-'~^s'-. Since {k — j) + {2i — j) < k, 

r is defined on Un and Un, and coincides on these words with Xa~-'^^''~-'sfc or y<a~'^~^'^^~''~^Sk- 
To conclude this example, note that Un and Un disagree on rankers in I^. More precisely, 
the ranker XaYcX^Ya is defined on Un but not on Vn- Further getting ahead of ourselves, we 
note that this example also shows (in view of Theorem 15. ip that 7X3 is properly contained 
in TOl 

Finally we prove a result on the containment of the TZm and Cm hierarchies in the TC ^ 
hierarchy. 
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Theorem 4.3. Let m>l. Then TZm C TXfm-i o,nd Crn ^ T/^om-i- 

More precisely, for all n > m, Z G {X, Y} and u,v & A* , if u and v agree on rankers in 
^2m-i2n-iJ then they agree on condensed rankers in I^^. 

Proof. Without loss of generahty, we may assume Z = X. The proof is by induction on m. 
The resuh is trivial if m = 1, since 2m — 1 = 1 and 2n — 1 > n. We now assume that m >2 
and u, V agree on rankers in i?^^-! 2n-i- 

We use the characterization of \>m,n in Proposition 13.61 the consideration of length 
1 rankers shows that alph(tt) = alph(w). Since iiXn-s 2n-3 is contained in i?2m~i2n-i> 
we have u <iTO,-i,n-i v by induction. Now, for each letter a E alph(n), let u = u^au^ 
and V = V-av+ be the a-left factorizations. We want to show that u_ <m-i,n-i and 
u+ i>m,n-i 1^+ {u+ i>m-i,n-i 1^+ if TO = n). By induction, it suffices to show that n_ and f_ 
agree on rankers in i?2m-3,2n-3! and n+ and v+ agree on rankers in i?2m-i,2n--3 (i?2m-3,2n-3 
if m = n) . In the rest of the proof we silently rely on the results of Lemma 13.51 

Let s G ^2m-3 2n-3 defined on u_. If s starts with a Y-block, then X^s G i?2m-2 2n-2 
and XqS is defined on u. Moreover, \i p is any prefix of s, then XapYa £ I^m-i 2n-i is not 
defined on u. It follows that s is defined on v-. 

If instead s starts with an X-block, then s G i?2m-4 2n-4 and s is defined on n. If p is 
any prefix of s, then pY(j G i?2m-3,2n-3 and pY^ is not defined on u. As all these rankers 
are in i?2m-i,2n-ii same holds on t; and s is defined on v^. 

Let now s G i?2m-i 2n-3 ^ I^m-~3 2n-3 if = TO-) be defined on u^. If s starts with 
an X-block, then X^s G ii2m-i 2n-2 (^a'^ G i?2m-3 2n-2 if JT- = to) and XqS is defined on 
u. Moreover, for each prefix p of s ending with a Y-letter, y^aP^a ^ I^m—i 2n—i 
^2m-3 2n-i if ?^ = to) and XapVa is defined on u. As all these rankers are in i?2m-i 2n-i> 
the same holds on v and s is defined on 

If instead s starts with a Y-block, then s G i?2-m-2 2n-4 ^ i^2m-4 2n-4 if ?^ = to) and 
s is defined on u. Moreover, if p is any prefix of s ending with a Y-letter, pYa G i?2m-2,2n-3 
(pYa G i?2m-4 2n-3 if ^^ = to) and pY^ is defined on u. As all these rankers are in I^Yn—i 2n— i! 
the same holds on v and s is defined on □ 

The containment of TZm and Cm into TX^m-i and TCX^-i^ respectively, is not very 
precise, unfortunately, especially in view of Theorem 16.41 below. 

5. The hierarchy and FO^ 
The objective of this section is to prove the following theorem. 

Theorem 5.1. Let to > 1. Every language in TZm or Cm is fO'^-definable, and every 
FOm-definable language is in TZm+i H Cm+i- Equivalently, we have 

where V V W denotes the least pseudovariety containing V and W. 
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5.1. Are the containments in Theorem 15. II strict? In the particular case where m = 1, 
we know that R2 nL2 = RnL = J = Ri V Li: this reflects the fact that is the class 
the piecewise testable languages. However, we conjecture that this equality does not hold 
for larger values of m. 

Conjecture 5.2. For m > 2, R^ V is properly contained in Rm+i H L^+i. 

The following example proves the conjecture for m = 2. 
Example 5.3. L = {b,c}*ca{a,b}* is FO|-definable, by the following formula: 



The words n„ = [hc)^{ah)'^ are in L, while the words Vn = (6c)"6(ca)"' are not. Almeida and 
Azevedo showed that R2 V L2 is defined by the pseudo-identity (6c)'^(a5)" = {bc)^b{ab)^ 
[21 Theorem 9.2.13 and Exercise 9.2.15]). In particular, for each language K recognized 
by a monoid in R2 V L2, the words n„ and f„ (for n large enough) are all in K, or all in 
the complement of K. Therefore L is not recognized by such a monoid, which proves that 
R2 V L2 is strictly contained in FO2, and hence also in R3 H L3. It also shows that TX q is 
properly contained in J^O^- 

Finally, we formulate the following conjecture. 

Conjecture 5.4. For each m > 1, FO^ = Rm+i H 'Lm+i- 

5.2. Proof of Theorem 15.11 Corollary 13.121 already established that every language in 



In view of Theorem 13. IH to establish that J^O^ is contained in T^^+i H/Im+i) it suffices 
to prove the following result. 

For each n > m > 1, if u >m+i,2n v or u <m+i,2n v, then Properties (WI 
Ic), (WI 2c) and (WI 3c) hold form, n. 

The result is trivial if m = 1, since in that case, only Property (WI Ic) is non- vacuous. 

So we now assume that m > 2, and u >m+i,2n v or u <rn+i,2n v. Property (WI Ic) holds 
trivially, by definition of the >m+i,2n and <im+i,2n relations. We now concentrate on proving 
that Properties (WI 2c) and (WI 3c) also hold for m, n, a task that will be completed in 
Section 15.2.31 

5.2.1. The case where r and r' start with opposite directions. 

Proposition 5.5. Let n > m > 1, r = Yq^s S iimn ^^'^ ^' — ^c- If u,v €z A* , r is con- 
densed on u and v and u >m,n+i v or u <m+i,n+i v, then ord(r(n), r'(M)) = ord(r(f ), r'(w)). 
The dual statement (involving r = Xa^s € i^^n ^^'^ — ^c) holds as well. 

^Of course, the same fact can be proved by the direct construction of an FO^-formula for each >m,n-class 
(by induction on m and using Proposition 13. 6|) . 



A Vi 



A 3i 



3i 



{c{i) A (Vj (j < i ^ -a(j))) A (Vi {j>i^ -c(j)))) 
(a(i) A (Vj (j <i^ -a(j))) A (Vj (j > i ^ -c(j)))) 
(h{i) ^ {3j (j < i A a(j)) V {3j ij>iA c(j)))). 
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Proof. First suppose that u <im+i,n+i v, that is, u and v agree on condensed rankers in 
n+i- We are in exactly one of the fohowing three situations: 

- rYc is defined on n, in which case r'[u) < r{u); 

- rYc is undefined on u and c is the last letter to occur in r, in which case r'{u) = r{u); 

- rYc is undefined on u and c is not the last letter to occur in r, in which case r{u) < 
r'[u). 

The same trichotomy holds for v. Since rYc G ^n+i n+ii ^^'^ agree on rYc (Propo- 
sition [2]TU|) , and hence ord(r(M), r'(M)) = ord(r(u), r'(w)). 

Let us now assume that u \>m,n+i v, so that u and v agree on condensed rankers in ^j^^^. 
If m = 1 then r is of the form r = Y^^ • • • Ya^, and we observe again that 

- either X^X^, • • • X^^ G i?^„ j^i is defined on u, and we have r'{u) < r{u); 

- or XcXaj. • • • Xa^ is undefined on u and c = a^, and we have r'(u) = r(u); 

- or XcXflj. • • • Xa^ is undefined on u and c ^ a^, and we have r'{u) > r{u). 

The same holds for v since XcXa,, ■ ■ ■ X^^ G „+i and such rankers are condensed where 
they are defined. Therefore we have ord{r{u),r' (u)) = ord{r{v),r'{v)). 

We now assume that m > 2. Let u = u-cu^ and v = v-cv+ be c-left factorizations. 
We distinguish two cases depending on the direction of the second letter of r. 

First suppose that r = Ya^Yajs'- If oi alph(n4_), then r(u) < r'(n) (because r is 
condensed on u). Since ^m.n ^+) we have alph(u_|_) — alph(i'_|_), so r(f) < r'(u) as well. 
If instead ai G alph(n_|_) = alph(v+), let = uqOiUi and f+ = vooifi be the ai-right 
factorizations. Then 

ord(r(n), r'(u)) = oi d(Y a2 s' {u-CUQ),r' {u^cuq)) and 
oid{r{v),r' (v)) = OTd{Ya2s' {v-CVo),r' (v-CVq)). 

Since {u-CUo)aiUi and {v-CVo)aiUi are ai-right factorizations as well, we deduce from 
Lemma 13.71 that u^cuq >rn,n v-CVq and it follows by induction on the length of r that 

ord{Ya2s'{u-CUo),r'{u-CUo)) = ord(Ya2s'(i;_cuo), r'(v_cuo)). 

The other case is r = Y^jX^js'. If ai G alph(cn+) = alph(ct;+) then r'(n) < r{u) and 
r'{v) < r{v). If instead ai alph(cii+) = alph(cw+), we first consider the case where r has 
a single alternation, i.e., r = Yq^X;,^ • • • X^^. We have r{u) < r'{u) if and only if r is defined 
on U- , and hence condensed (Example 13. 3p . S ince U— ^rn^n ^— (Lemma I3.TI) . this is the case 
if and only if r is defined on v-. Hence, if r is defined on u-, we have r(u) < r'{u) and 
r{v) < r'{v). If r is not defined on U-, but Ya^Xfe^ • • •Xfej._j is defined on u_ and bk = c, 
then the same holds for v and we have r(u) = r'{u) and r{v) = r'{v). Otherwise, we have 
r(n) > r'{u) and r{v) > r'{v). 

The last situation arises if r is of the form r = Yq^X^^ • • • Xi^^Yds" . In particular, m > 3. 
If Ya^Xfej • • • Xfe^ is defined on U-C, then it is defined on V-C as well (by the same reasoning 
as in the previous paragraph) and we have r{u) < r'{u) and r{v) < r'[v). 

Similarly, if YajX^^ • • •Xf,^_^ is not defined on ii_ and then we have r'{u) < r{u) 
and r'{v) < r{v). 

Finally, let us assume that Y^jX^j • • • X^^. is not defined on u-C or v-C, but Y^jXtj • • • X;,^, 
is defined on n_ and V-. Let n+ = u^hkUi and t>+ = v^hkVi be fe^-left factorizations. Then 

ord(r(n), r'(n)) = ord(Yrfs"(n_cno), r'(n„cno)) and 
ord(r(f ), /(v)) = oidiy ds" {v-CVq),t' {v-CVq)). 
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Since u >m,n+i v, we have n+ >m,n v+, and by Lemma [37fl , n_ >rn,n v- and uq >m,n-i vq. 
Therefore U-CUq >m,n-i V-CVq. Since Y^s" G ^^_2n-2 is condensed on both u^cuq and 
v-cvq, we conclude by induction on the length of r that OTd{Y ds"{u-CUQ),r'{u-cuo)) = 
oicd{Y(is"{v-CVo),r'{v-CVo)) and hence oid{r{u),r'{u)) = ord{r{v),r' (v)). 

This concludes the proof. □ 

Proposition 5.6. Let n > m > 1, let r = X^s € ii^ and r' = Y^s' € i?^ siic/i i/iai 
\r \ + |r'j < n, and let u,v & A* such that r and r' are condensed on u and v. If u >m+i,n v 
oru<rn+i,nV, then ord(r(u), r'(u)) = ord(r(t'), r'(f )). 

Proof. Without loss of generality, we assume that u \>rn+i,n v. We proceed by induction, 
first on m. If m = 1, then r = X^j • • • Xq^. and r' = Y^^ • • • Y^^ with k + I < n. We observe 
that \ip = rXfc^ • • • X;,^ is defined on u, then r{u) < r'{u); if p is not defined on n, but = hi 
and rXf)^_^ ■ ■ • X^^ is defined on n, then r(n) = r'{u); and in all other cases, r{u) > r'{u). 
The same holds for v, and this completes the proof in case m = 1. 

We now assume that m > 2 and proceed by induction on n. We first note that if one of 
r, r' has length 1, then the result was established in Proposition 15.51 We now assume that 
\r\ , [r'l > 2 (so \r\ , \r'\ < n — 2). 

Suppose that n = m + 1 and let /3(r) the number of alternating blocks in r: then 
f^ir) < |r| < n — \r'\ <n — 2 = m— 1. The same inequality holds for r' and we conclude by 
induction on m. 

We must now consider the case where n > m + 1 > 2. In particular, we have r G i?^ „_2 
and r' G 

First case: s starts with an X-block. Let u = U-au+ and v = V-av^ be a-left-factorizations. 
Then s is condensed on n-(_ and f+ and n-(_ >m+i,n-i "f+j so and v+ agree on rankers 
in (Proposition I3.10"|) . In particular, n_|_ and v+ agree on r' . If r' is defined 

on M+, then ord(r(u), r'(ii)) = ord(s(u4.), r'(M-|_)). Moreover, r' is defined on ?;_(_ as well 
and ord(r(f ), r'(r;)) = ord(s(w+), r'(f+)), so we conclude by induction. If instead r' is not 
defined on u+ or v+, then r'{u) < Xa{u) < r{u) and r'{v) < Xa{v) < r{v). 

Second case: s' starts with a Y -block. Let u = u^bu^ and v = be 6-right factoriza- 

tions. Then U- >m+i,n-i by Lemma 13.71 and this case can be handled exactly like the 
previous one. 

Third case: s starts with a Y-block and s' starts with an X-block. If Xa{u) < Yi,(u), then 
Xa(^^) < Yfe(ij) (by Proposition 15. 5p . we have r(u) < Xa{u) < Yf,{u) < r'{u), and the same 
inequalities hold for v. 

We now assume that Xa{u) > Yiy{u) and Xa{v) > Yf){v). In particular, a ^b. Identifying 
the first a and the last 6 in u and f , we get factorizations u = u^buoau^ and v = V-bvQav+ 
such that a alph(u_6tio) U a\p\r\{v-bvo) and b a\ph{uoau+) U a\ph{voav+). In particular, 
r(n) = s(ii_6no), r'{u) is the position s'{uQau^) in the suffix UQau^ of n, and the same 
holds in v. Moreover, u = {u-buo)au+ is an a-left factorization, u = u-b{uoau+) is a 
6-right factorization, and the same holds in v. Therefore, and since u >m+i,n v, we have 
U-buo <\m,n-i V-bvQ by definition and uq <m,n-2 vq by Lemma \37l\ 

Since s € iim-i n-3 ^' ^ i^m-i n-3 — i^mn-2' Pfoposition 13.101 shows that, if s 
is not defined on uq, then it is not defined on vq either, and r{u) < Yh{u) < r'{u) and 
similarly, r(v) < r'{v). Symmetrically, if s' is not defined on uq, then r(n) < Xa{u) < r'{u) 
and r{v) < r'{v). 
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Finally, if s and s' are defined on uq, then 

ord(r(ti), r'(u)) = ord(s(uo), s'(no)) and 
oi:d{r{v),r' (v)) = oid{s{vo), s' (vo)), 
and we conclude by induction. □ 



5.2.2. The case where r and r' start with the same direction. 

Proposition 5.7. Let n > m > 2, r E i?mn starting with an '/■.-letter, and r' = Xc- If 

u,v (z A*, r is condensed onu andv andu\>m,n+i v, then OTd{r{u),r' (u)) = ord{r{v),r'{v)). 
The dual statement (involving r € R^n> = starting with a ^-letter, and u <\m,n+i v) 
holds as well. 

Proof. We proceed by induction, first on m. If m = 2, then either r = X^^ • • • X^^. or 
r = Xqj • • •Xflj.Yf,^ • • • Yf,^. In the first case, the order type ord(r(u), r'(u)) depends, as in 
the proof of Proposition 15. 6| on whether XcYaj. • • • Y^^ is defined on u, or if it is not defined, 
whether = c and XcYa^-i ' ' ' is defined. Since these rankers are in i?2 n+i 
condensed where they are defined (Example l3.3p . we have ord(r(n), r'(n)) = ord(r(7;), r'(t;)). 

In the second case, where r = X^^ • • • Xaj.Yfej • • • Y;,^, three cases arise: if rYc is defined 
on u, then Xc{u) < r{u); if rYc is not defined and c = bi, then Xc{u) = r{u); in all other 
cases, r(n) < Xc(n). Since u i>2,n+i v and rYc £ i^2 n+i) Proposition 13.101 shows that rYc is 
defined on u if and only if it is defined on v, and ord(r(ii), r'(M)) = ord(r(w), r'(f )). 

We now assume that m > 3. If r has less than m alternating blocks, we conclude by 
induction on m. Let us suppose now that r has m alternating blocks and let us proceed by 
induction on |r| > m. 

Let r = XaS. If s starts with a Y-letter (which includes the base case where |r| = m), 
then s G R^-i n-i is condensed on ii_ and f_. If c alph(u_) = alph(z;_), then r(u) < r'{u) 
and r{v) < r'{v). In all other cases, 

ord(r(ti), r'(u)) = ord(s(u_), r'(n_)) and 
ord(r(?j), r'(t;)) = ord(s(t)„), r'(?;_)). 

Since n_ >m,n V- by Lemma 13.71 these two order types are equal by Proposition 15.51 

If instead s starts with an X-letter, then |r | > m, s E i?^ is condensed on Uj^ and Vj^ 
(Lemma 13. 4[) and we distinguish two cases. If c G alph(M_a) = alph(?;_a), then r'(n) < r(n) 
and r'{v) < r[v). Otherwise 

ord(r(ti), r'(u)) = ord(s(ii4.), r'(n+)) and 
ord(r(?j), r'(t;)) = ord(s(i;+), r'(v+)). 
Since u+ >m,n these two order types are equal by induction on n. □ 

Proposition 5.8. Let n > m > 2, let r = X^s G i?^ and r' = Xf,s' G such that 

\r\ + |r'| < n, and let u,v & A* such that r and r' are condensed on u and v. If u >m,n v, 
then ovd{r{u),r'{u)) = or:d{r{v),r'{v)). The dual statement (where r,r' start with Y -blocks 
and u <m,n V ) holds as well. 
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Proof. The proof is by induction on m, and then on n. If one of r and r' has length 1, then 
the result was established in Proposition 15. 7i This takes care of the cases where n < 3, 
including the base case m = n = 2. We now assume that \r\ , \r'\ > 2. 

Let us observe that under this assumption, if n = m, then the number of alternating 
blocks in r is less than or equal to m — 2: indeed it is at most equal to \r\ <n — 2 = m — 2. 
The same inequality holds for r', so this situation is handled by induction on m. We can 
now assume that n > m. 

Let u = U-ttu^ = u'_bu'^ and v = f_af+ = v'_bv'j^ be a- left and 6-left factorizations. 

First case: a = b. If s starts with an X-block and s' starts with a Y-block, then r'{u) < r{u) 
and r'{v) < r{v). Dually, if s starts with a Y-block and s' starts with an X-block, then 
r'{u) > r{u) and r'{v) > r{v). 

If s and s' both start with a Y-block (which can happen only if m — 1 > 2), then 
s E s' € i?m-2 ^^'^ Condensed on ti_ and and 

ord{r{u),r'{u)) = ord(s(u_), s'(m_)) and 

OT:d{r{v),r' (v)) = ord(s(w_), s'(f _)). 

Since u_ <m-i,n-i V- and |s| + < n — 2, we have ord(r(u), r'(u)) = ord(r(w), r'(f )) by 
induction on m. 

If instead s and s' both start with an X-block, then s € i?^ and s' € condensed 
on u-\- and and we have 

ord(r(ti), r'(u)) = ord(s(u+), s'(m+)) and 
ord(r(t)), /(v)) = ord(s(u+), s'(f+)). 

Since ii+ Om,n-i Vj^ and \s\ + \s'\ < n — 2, we have ord(r(ii), r'(u)) = ord(r(f ), r'(w)) by 
induction on n. 

Second case: a ^ b, s and s' start with X-blocks. Then s G s' € condensed 

on ti+ and v+. Without loss of generality, Xb{u) < Xa{u), so we have r{u) = Xas(n) = 
Xf,Xas(n) = Xi,r{u). In particular, ord(r(ti), r'('u)) = ord(r(tiY), s' («'_,_)). By Proposition lSTTl 
we also have Xft(u) < Xa{v), and hence ord(r(v), r'(f )) = ord{r{v'j^), s'{v'j^)). Since u c>m,n v, 
we have i>m,n-i v'j^ and we conclude by induction on n since |r| + < n — 1. 

Third case: a ^ b, s and s' start with Y -blocks. This can occur only if m — 1 > 2. Then s G 
and s' G i?m-2 ^'^'^ condensed on u_ and f_, r(n) = s(ii-) and r'(u) = s'{u'_), and the 
same equalities hold for v. Without loss of generality, we may assume that Xb{u) < Xa{u), 
and hence Xb{v) < Xa{v) (Proposition 15. 7p . Let uq and vq be such that u = u'_buoau+ and 
V = v'_bvoav+: then uq is the left factor in the a-left decomposition of u'j^ and the right 
factor in the 6-left decomposition of u_. An analogous statement is true for vq. There are 
two cases, depending on whether s is defined on buQ. If this is the case, then r'{u) < r{u). 
Moreover, we have u'_^_ >m,n-i v'^ and uq <rn-i,n~2 vq, so s is defined on bvQ as well, by 
Proposition 13.101 

If instead, s is not defined on buQ or bvQ, let p be the longest prefix of s which is defined 
on buQ (and hence on bvo): then p is either empty or a Y-block and s = pYct, where c has 
no occurrence in u[Xb{u);Xap{u) — 1] (so Yc is defined on u'_). 

If Yet is defined on u'_, then r(n) = s(^i-) = Yct(tt'_), so that 

ord(r(n), r'(n)) = ord(Yct(n_), s'(n_)). 
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Now u >rn,n V implies u'_ >m,n-i V- by Proposition 13.71 so Yet is defined on v'_ and hence 
we have ord{r(v),r' (v)) = ord{Y ct{v'_), s' {v'_)) as well. Since |Yct| < \s\ < \r\, we conclude 
by induction that oid{r{u),r' (u)) = oid{r{v),r'{v)). 

If Yet is not defined on u'_, then let YcQ be the longest prefix of Yet which is defined on 
u'_ (and hence on v'_). Then q is either empty or an X-block and Yet = Yeq^dt' ■ d = b, 
then qX(i{u'_b) = Xf,{u), so r(u) = X{,t'(n) and similarly, r{v) = X{,t'(f). We conclude by 
induction on m that ord(r(n), r'(M)) = ord(r(w), r'(w)) since X^t' has 2 blocks less than r. 

If d ^ b, then we have Xfe('u) < XapY eqX(i{u) . If X^t' is defined on buo, then r{u) lies in 
uq and r'(u) lies in u'_, so r(u) > r'{u). Similarly r(v) > r'{v), and we are done. If instead 
Xrfi' is not defined on huQ, then XapYeq{u) < Xi,(u) and XapYeqXii{u) = Xf,X(i{u), so the 
condensedness of r = XapYeqX^t' on u implies that X^X^t' is condensed on u as well. The 
same holds for v, and we have 

ord(r(n), r'(u)) = ord(XbXrft'(?i), r'(n)) and similarly, 

ord{r{v),r'{v)) = oid{XbXdt'{v),r'{v)). 

We conclude by induction on m since Xi^X^t' has 2 blocks less than r. 

Fourth case: a ^ b, s and s' start with different directions. Without loss of generality, we 
may assume that s starts with an X-block and s' starts with a Y-block. Since r starts with 
2 X-letters, the number of alternating blocks of r is less than |r| — 1 < n — 3. Therefore if 
n = m + 1, r G i?m-2 r' G ^ case that can be decided by induction on m. So we 

now assume that n > m — 2. 

If Xb(u) < Xa{u), then the same inequality holds in v (by Proposition 15. 7p and we have 
r'{u) < r{u) and r'{v) < r{v). If instead Xa{u) < Xf,(ti) and Xa{v) < X;,(t>), then the 6-left 
factorizations of n+ and V-^. are of the form u+ = uobu'j^ and t>+ = ^06^+. 

Several cases arise, according to whether s and s' are defined (and condensed) on uq 
or not. We have w+ >m,n-i v+ and uq i>m,n-2 vq by Lemma [3^1 It follows as usual that s 
and s' are defined on vq if and only if they are defined on uq. If s is not defined on uq then 
the order types ord(r(u), r'(ti)) and oid[r{v),r' [v)) are both >. Therefore, from now on we 
can assume that s is defined on uq and vq. 

If s' is defined on uq then we can chop off U-a from u, V-a from t;, and X^ from 
r: ord(r(n), r'(n)) = ord(s(ti+), r'(n-(_)) and oid{r{v),r' {v)) = ord(s(T;+), r'(z;+)). Since 
u+ >m,n-i 'W+i ord(s(u+), r'(n+)) and ord(s(t;4.), r'(f+)) are equal by induction on n, and 
hence ord(r(ti), r'(u)) = oid{r{v),r' {v)). 

If s' is not defined on uq, then, as in the third case, we have to split the ranker s' at 
those points at which it crosses the position Xa{u). Let Xf,s' = piqi ■ ■ -pkQk such that all 
Pi are defined on uq and all pi are starting with an X-letter followed by a (possibly empty) 
Y-block. The sole exception is pk which might contain further blocks. Moreover, each pi is 
the maximal prefix of Piqi- ■ -pkQk which is defined on uq. All qi are defined on ti_a and all 
qi are starting with a Y-letter followed by a (possibly empty) X-block. The sole exception 
is qk which might be empty or which might contain further blocks. Each q^ is the maximal 
prefix of qiPi+i ■ ■ -pkQk which is defined on u-a. Since n_a t>m,n-i V-a (Lemma 13. 7p and 
Uq <rn-i,n-2 vq, the Same definedness and maximality properties hold on v^a and vq. 

If qk is empty, then k >2 and pi and qi are non-empty. We see that ord(r(n), r'(n)) = 
ord(s(no),Pfc('Uo)) and oicd{r{v),r' (v)) = oid{s{vo),pk{vo)). By induction on n, we have 
ord(s(no),Pfc('Uo)) = ord{s{vo),pk{vo)), and hence ord(r(u), r'(ti)) = oTd{r(v),r'{v)). 

Finally, if qk is non-empty, then we have r(n) > r'{u) and r{v) > r'{v). □ 
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5.2.3. Completing the proof of Theorem \5.1\ Let us (at last!) verify that, if u l>m+i,2n v or 
u l>m+i,2n V, then Properties (WI 2c) and (WI 3c) hold for m, n. By symmetry, we simply 
handle the case where u >m+i,2n v. 

To verify Property (WI 2c), we consider rankers r £ „ and r' € Rm-i n~i that are 
condensed on u and v. If both start with X-blocks, Prop osit ion 1 5 . 8 1 shows that oid{r{u),r' [u)) 
and ord(r(v), r'(f )) coincide. If both start with Y-blocks, the same proposition allows us 
to conclude, after observing that we have u <im,2n-i v. And if r and r' start with different 
direction blocks, we conclude by Proposition 15.61 

To verify Property (WI 3c), we consider rankers r G Rmn ^'^d r' G Rmn-i ^^^^ end 
with different directions, and that are condensed on u and v. If r and r' start with different 
direction blocks, we again conclude by Proposition 15.61 If both start with X-blocks, then 
they must have different number of alternations, so we have r G R^^ „i r' G R^^ „2 for 
some ni < n, 722 < n — 1 and for distinct values mi,m2 < m. In particular, one of nii and 
777-2 is less than or equal to 77T, — 1, and we can apply Proposition 15.81 

We proceed similarly if r and r' both start with Y-blocks, after observing that u <m,2n-i 
V. This completes the proof of Theorem 15.11 

6. Consequences 

6.1. Decidability results. The main consequence we draw of Theorem 15.11 and of the 

decidability of the pseudovarieties Rm and is summarized in the next statement. 

Theorem 6.1. Given an VO'^ -definable language L, one can compute an integer m such 
that L is VQ^j^i- definable, possibly FO'^-definable, but not FO^_i-definable. That is: we 
can decide the quantifier alternation level of L within one unit. 

Proof. Let L G TO^ and let M be its syntactic monoid. Since each pseudovariety nL^ 
is decidable (Proposition 13.23]) . we can compute the largest m such that M ^ R^ n L^- 
By Theorem 15.11 M G Rm+i H L^+i ^ FO^_,_]^ and hence L is FO^_,_]^-definable. On the 
other hand, M FO^_i C R„ n L^. □ 

Let us also record the following consequences of Proposition 13.231 Proposition 14.11 and 
the decidability of R2 V L2 (discussed in Example 15.30 . 

Proposition 6.2. The classes 7X^ = TC j = TCi = J^Of, TC^, TCX and TL^ decid- 
able. 

6.2. Infinite and collapsing hierarchies. The fact that the Rm and form strict 
hierarchies (Proposition I3.23|) . together with Theorem 1 5. H proves that the TO\^ hierarchy 
is infinite. Weis and Immerman had already proved this result by combinatorial means |371 
Theorem 4.11], whereas our proof is algebraic. Prom that result on the J'O^ hierarchy, it 
is also possible to recover the strict hierarchy result on the Rm and and the fact that 
their union is equal to DA. 

By the same token, Corollary 13.121 and Theorem 14.31 show that the 7X ^ (resp. TL ^) 
hierarchy is infinite and that its union is all of J'O^ (resp. DA). 

Theorem 6.3. The hierarchies J'O'^ and are infinite, and their union is all ofJ-O^. 
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Similarly, the fact (stated in Proposition 13.23] ) that an m-generated element of DA lies 
in Rm+i n Lm+i, shows that an FO^-definable language in A* lies in T^^+i fl C\j^\^i, and 
hence in J^O^^j^,^ - a fact that was already established by combinatorial means by Weis 
and Immerman [37' Theorem 4.7]. It also shows that such a language is in T^2\A\+i by 
Theorem 14. 3[ 

Theorem 6.4. A language L Q A* is fO^ -definable if and only if it is fO'^j^^^^-definable. 
And it is TL-definable if and only if it is both TL^^j | ^ and TL^^j | ^-definable. 

Even though we arrived at Theorem 16.41 by algebraic means, it is interesting to note 
that its statement reflects the following combinatorial property (an idea that was already 
used by Weis and Immerman [37, Theorem 4.7]). 

Lemma 6.5. A ranker that is condensed on a word on alphabet A, has at most \A\ alter- 
nating blocks. 

Proof. Let u be a word and let r be a ranker that is condensed on u. Without loss of 
generality, we may assume that r E „ , say 

with < ki < k2 < ■ ■ ■ < km = n and Z = X (resp. Y) if m is odd (resp. even). By definition 
of condensed rankers (and with the notation in that definition, see Section [3]), the interval 
4^ is of the form (zfc^_i; Xa^,^ (n, ifc^_i)) if h is odd, of the form (Ya^^^ (u, jfc;,-i); jfch-i) if h 
is even. In either case, a^^^^ occurs in u within the interval 1^^ but a^^ does not. Since 
the intervals Ik^ are nested, it follows that the letters Ofc^, a^j, • • • , afc„ are pairwise distinct, 
and hence m < |^|. □ 



6.3. Infinite hierarchies and unambiguous polynomials. Finally we note the follow- 
ing refinement of \15\ Proposition 4.6]. One of the classical (and one of the earliest) results 
concerning the languages recognized by monoids in DA is the following: they are exactly 
the disjoint unions of unambiguous products of the form B^aiBl ■ ■ ■ a^B^, where each Bi is 
a subset of A (Schiitzenberger ^5], see also [311 |32l [6] ) . Recall that such a product is unam- 
biguous if each word w G B^aiBl ■ ■ ■ OkB^. factors in a unique way as w = uqOiUi ■ ■ ■ OkUk 
with Ui G B*. Deterministic and co-deterministic products (see Section 13. 4|) are easily 
seen to be particular cases of unambiguous products. Propositions 13.211 and 13.251 implv the 
following statement. 

Proposition 6.6. The least variety of languages containing the languages of the form B* 
(B (1 A) and closed under visibly deterministic and visibly co- deterministic products, is 

More precisely, every unambiguous product of the form B^aiBl ■ ■ ■ atB'^, where each Bi 
is a subset of A, can be expressed in terms of Boolean operations and at most \ A\ + 1 alter- 
nated applications of visibly deterministic and visibly co- deterministic products - starting 
with a visibly deterministic (resp. co- deterministic) product. 

The analogous, but weaker statement with the word visibly deleted was proved in [15] 
by algebraic means, and independently by Lodaya, Pandya and Shah using logical and 
combinatorial arguments [17j . 
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Conclusion. We have related the FO^ hierarchy with the TZm-C-m hierarchy, a hierarchy of 
varieties of languages which is connected with the alternation of closures under deterministic 
and co-deterministic products. 

The varieties TZm and Cm are decidable, but the link we establish with TOl^ (Theo- 
rem I5.ip is not tight enough to prove decidability of the quantifier alternation hierarchy. 
We recall the readers of our conjecture (Conjecture 15.41 above), according to which TO"^ is 
equal to the intersection TZm+i H Cm+i- Establishing this conjecture would prove that each 
level of the quantifier alternation hierarchy J-O^ is decidable. 

Finally, we refer the reader to Straubing's result: he showed [28] that the pseudovariety 
FO^ is the m-th weakly iterated power of the pseudovariety J of jT-trivial monoids (more 
precisely, FOf = J and FO^^^ = FO^ □ J). This result offers a different avenue to solve 
the decidability problem for FO^-definability, and our conjecture would show the equality 
between two algebraic hierarchies which seem completely unrelated. 
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